Mosaicing-by-recognition for recognizing texts captured in multiple video frames
نویسندگان
چکیده
Text recognition in video frames is promising because of its following superiorities over text recognition in a still camera image: (1) it is possible to recognize longer texts by concatenating the frames, and (2) it is also possible to improve the quality of the text image by integrating the frames. In this paper, a mosaicing-by-recognition technique is proposed where video mosaicing and text recognition are simultaneously and collaboratively performed in a one-step manner by a dynamic programming-based optimization algorithm. In this optimization algorithm, rotation, scaling, vertical shift, and speed fluctuation of camera motion are efficiently compensated. The results of experiments to evaluate not only the accuracy of text recognition but also that of video mosaicing indicates that the proposed technique is practical and can provide reasonable results in most cases.
منابع مشابه
Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کاملDetection and Recognition of Multi-language Traffic Sign Context by Intelligent Driver Assistance Systems
Design of a new intelligent driver assistance system based on traffic sign detection with Persian context is concerned in this paper. The primary aim of this system is to increase the precision of drivers in choosing their path with regard to traffic signs. To achieve this goal, a new framework that implements fuzzy logic was used to detect traffic signs in videos captured along a highway f...
متن کاملRobust Video Mosaicing through Topology Inference and Local to Global Alignment
The problem of piecing together individual frames in a video sequence to create seamless panoramas (video mosaics) has attracted increasing attention in recent times. One challenge in this domain has been to rapidly and automatically create high quality seamless mosaics using inexpensive cameras and relatively free hand motions. In order to capture a wide angle scene using a video sequence of r...
متن کاملImage Mosaicing using first order Moments and Pixel to Pixel Comparison
In real-life situations, the field of view of the imaging devices is usually smaller than the scene to be imaged, due to the inherent limitations of the imaging devices. Even if the whole scene is captured in a single exposure, it could result in an image having poor resolution. In such circumstances, it will never be possible to capture the scene as a single image for further processing or hum...
متن کاملVideo Mosaicing using Manifold Projection
Video mosaicing is commonly used to increase the visual eld of view by pasting together many video frames. Existing mosaicing methods are eeective only in very limited cases where the image motion is almost a uniform translation or the camera performs a pure pan. Forward camera motion or camera zoom are very problematic for traditional mosaicing. A mosaicing methodology to allow image mosaicing...
متن کامل